3574 results found.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
9.3 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:NIESR: Nuisance Invariant End-to-end Speech Recognition
-
Paper track:8.3 Robustness against noise or reverberation/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | I-Hung Hsu | CSR-I (WSJ0) Sennheiser | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Person Identification
-
Paper title:Optimizing a Speaker Embedding Extractor Through Backend-Driven Regularization
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Luciana Ferrer | FVC Australian | /N |
Documentation:
Paper
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
None Production Status:
Existing-used
Use:
Person Identification
-
Paper title:Optimizing a Speaker Embedding Extractor Through Backend-Driven Regularization
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Luciana Ferrer | Speakers in the wild | /N |
Documentation:
Paper
Speech
Corpus,
Language Type:
Bilingual
Languages:
English Many others
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Person Identification
-
Paper title:Optimizing a Speaker Embedding Extractor Through Backend-Driven Regularization
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Luciana Ferrer | NIST SRE Evaluations | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic English Korean Spanish
Availability:
Not Available
License:
Size:
None Production Status:
Existing-used
Use:
Person Identification
-
Paper title:Optimizing a Speaker Embedding Extractor Through Backend-Driven Regularization
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Luciana Ferrer | LASRS | /N |
Documentation:
Paper
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY 4.0
Size:
1000 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Jasper: An End-to-End Convolutional Neural Acoustic Model
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Vitaly Lavrukhin | LibriSpeech | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
2042 MByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Speech Enhancement with Variance Constrained Autoencoders
-
Paper track:6.3 Noise reduction for speech signals/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daniel Braithwaite | Noisy speech database for training speech enhancement algorithms and TTS models | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Attribution-NonCommercial 4.0 Generic (CC BY-NC 4.0)
Size:
380 GByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Synchronising audio and ultrasound by learning cross-modal embeddings
-
Paper track:10.4 Speech science in end-user applications/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aciel Eshky | UltraSuite | /N |
Documentation:
https://ultrasuite.github.io/
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-NC 4.0
Size:
5.7 GByte Production Status:
Existing-updated
Use:
Speech Synthesis
-
Paper title:Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams
-
Paper track:7.12 Voice modification, conversion and morphing/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Guanlong Zhao | L2-ARCTIC corpus | /N |
Documentation:
https://psi.engr.tamu.edu/l2-arctic-corpus-docs/
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams
-
Paper track:7.12 Voice modification, conversion and morphing/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Guanlong Zhao | CMU ARCTIC database | /N |
Documentation:
http://festvox.org/cmu_arctic/cmu_arctic_report.pdf




